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DETECTION METHOD FOR C-RAF-1 GENES 

5 



Field of the Invention 

1 o The present invention relates to ( 1 ) a method of 

identifying an individual at an increased risk for 
developing cancer, (2) a method for determining a 
prognosis of patients afflicted with cancer, and (3) a 
method for determining the proper course of treatment for 

15 a patient afflicted with cancer. 

Background Information 

Lung cancer claims more lives in the United 

20 States than any other neoplasm (R.S. Finley, Am. Pharm. 
NS29, 39 (1989)), and of the various forms lung 
adenocarcinomas have one of the worst prognoses (T.P. 
Miller, Semin. Oncol. 17, 11 (1990)). The incidence of 
adenocarcinoma of the lung (ACL) in the United States is 

25 also quickly rising (I. Linnoila, Hematol. Oncol . North . 
Am. 4, 1027 (1990); J.B. Sorensen, H.H. Hansen, Cancer 
Surviv . 8, 671 (1989)). In order to gain insight into 
this complex and deadly disease, a model system for its 
study has been developed. For such a model to provide 

30 clinically relevant data several criteria must be met. 

The tumors produced should be histologically equivalent to 
their human counterparts, tumor induction must be reliable 
and reproducible, and the numbers generated must be great 
enough to provide statistical significance. To satisfy 

35 these conditions a system has been created which uses two 
inbred mouse strains (NFS/n and AKR) along with 
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15 



transplacental exposure to the potent carcinogen N-ethyl- 
N-nitrosourea (ENU) and promotion with the antioxidant 
butylated hydroxytoluene (BHT) . The resulting tumors were 
examined for altered expression or structural mutations of 
5 genes implicated in lung tumor development such as rasi. 
mvc, and raf oncogenes (CD. Little et al., Nature 306, 
194 (1983); P.E. Kiefer et al., Cancer Res,., 47, 6236 
(1987); E. Santos et al., Science 223, 661 (1984); S. 
Rodenhuis, » «™i .t. Med. 317, 929 (1987); M. Barbacid, 
10 — -t TnvMt. 20, 225 (1990); U.R. Rapp et al. , i. 

Tn f* A Ssne . «-h* Study of T.ung Cancer 4, 162 (1988); 

M.J. Birrer et al., Ann Rev. Med. 40, 305 (1989); G. 
Sithanandam et al., Oncogene 4, 451 (1989)). 

raf proto-oncogenes are evolutionarily highly 
conserved genes encoding cytoplasmic serine/ threonine 
specific kinases, which function in mitogen signal 
transduction (reviewed in U.R. Rapp et al., The Oncogene 
Hgndfeook, T. Curran et al., Eds. (Elsevier Science 
20 Publishers, The Netherlands, 1988), pp. 115-154; U.R. 
Rapp , oncogene 6, 495 (1991)). The three known active 
members in the raf family encode phosphoproteins of 
similar size (72/74 kD for Raf-1 ; 68 kD for A-Raf-1 , and 
74 kD for B-Raf (U.R. Rapp et al., in Pe^ovi ruses and 
25 »» ma n P^holoav . R. Gallo et al., Eds. (Humana Press, 

Clifton, New Jersey 1985), pp. 449-472; T.W. Beck et al., 
vwo 0 ^ Arvids Res. 15, 595 (1987); G. Sithanandam et al., 
oncogene 5, 1775 (1990))). Raf-1 was first identified as 
the cellular homologue of v-raf (H.W. Jansen et al., 
30 Nature 307, 218 (1984)), the transforming gene of 3611 MSV 
(U.R. Rapp et al., »T- Virpl. 45, 914 (1983); U.R. Rapp et 
al , — Ac -* " SA 80 ' 4218 < 1983 >>- A,Bino 

acid comparisons of raf family genes shows three conserved 
regions [CR1 , CR2, CR3] (T.W. Beck et al., Nucleic Acids 
35 Res. 15, 595 (1987)); CR1 is a regulatory region 

surrounding a Cys finger consensus sequence, CR2 is a 
serine/threonine rich region, and CR3 represents the 
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kinase domain. Raf-1 has been mapped to chromosome 3p25 
in humans (S.J. O'Brien et al., Science 223, 71 (1984)), 
and this region has been found to be frequently altered in 
small cell lung carcinoma (SCLC) (J. Whang-Peng et al., 
5 Cancer Genet. Cvtocrenet. 6, 119 (1982); J.M. Ibson et al. , 
J. Cell. Biochem. 33, 267 (1987)), familial renal cell 
carcinoma (A.J. Cohen et al-, N. Enol. J. Med. 301, 592 
(1979); G. Kovacs et al., Int. J, Cancer 40, 171 (1987)), 
mixed parotid gland tumors (J. Mark et al., Hereditas 96, 
10 141 (1982)), and ovarian cancer (K. Tanaka et al., Cancer 
Genet. Cvtoaenet. 43, 1 (1989)). 

Raf genes are differentially expressed in 
various tissues (S.M. Storm et al., Oncogene 5, 345 

15 (1990)). c- raf- 1 has been found to be expressed 

ubiquitously, though absolute levels vary between tissues. 
A- raf- 1 is present predominantly in the urogenital 
tissues, whereas B -Raf is most abundant in cerebrum and 
testis. The ubiquitous c-Raf-1 kinase is regulated by 

20 tyrosine and serine phosphorylations that result from 

activated growth factor receptor kinases (D.K. Morrison et 
al., Cell 58, 648 (1989); D.K- Morrison et al., Proc. 
Natl. Acad. Sci. USA 85, 8855 (1989); K.S. Kovacina et 



J. 


Biol. 


Chem. 


265, 


12115 


(1990); 


P. 


J. 


Blackshear et 


J. 


Biol . 


Chem . 


265, 


12131 


(1990); 


M. 


P. 


Carroll et 


J. 


Biol. 


Chem. 


265, 


19812 


(1990); 


J. 


N. 


Siegel et al . , 



.7- Biol. Chem. 265, 18472 (1990); B.C. Turner et al., 
Proc. Natl A^ari. Sci. USA 88, 1227 (1991 ); M. Baccarini 
et al., tsmbo J. 9, 3649 (1990); H. App et al., Mol. Cell. 

30 Biol . 11,- 913 (1991)). Raf-1 operates downstream of Ras 
in mitogen signal transduction as judged by experiments 
using antibody microinjection (M.R. Smith et al., Nature 
320, 540 (1986)), c- raf- 1 antisense expression constructs 
(W. Kolch et al., Nature 349, 426 (1991)), dominant 

35 negative mutants (W. Kolch et al., Nature 349, 426 

(1991)), and Raf revertant cells. Studies with NIH3T3 
cells and brain tissue demonstrated that mitogen treatment 
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induces Raf-1 kinase activity and causes a transitory 
relocation of the active enzyme from the cytoplasm to the 
nucleus and perinuclear area (Z. Olah et al., Exp. Brain. 
Res . (in press); U.R. Rapp et al., in rolrl Spring Harbor 
5 Br r .^ on on,- -*-™™ Bioloov , Vol. LIII, Eds. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
1988) pp. T73-184). 

Raf-1 coupling has been examined in more than a 
10 dozen receptor systems and all strong mitogens stimulated 

Raf-1 kinase activity (U.R. Rapp, Oncogene 6, 495 (1991); 

^ Morrison et al., Ceil 58, 648 (1989); D.K. Morrison 

e 't al , — Ac * - PSA 85, 8855 (1989); K.S. 

Kovaclna et al., ,T Biol. Chem.. 265, 12115 (1990); P.J. 
15 Blackshear et al., ,T Biol , Chem. 265, 12131 (1990); M.P. 

Carroll et al., .T Biol . Chem. 265, 19812 (1990); J.N. 

Siegel et al., ,T Biol. Chem.. 265, 18472 (1990); B.C 

Turner et al. , Prnr Nnt~1 Ar^ ■ Sci. PSA 88, 1227 (1991 ); 

M Baccarini et al., EMBO J.. 9, 3649 (1990); H. A PP et 

20 al., Biol. 11 ' 913 < 1991)) ' and th±S 

stimulation correlated with an increase in gaf-1 
phosphorylation leading to a shift in apparent molecular 
weight . 

25 SUMMARY OF THE INVENTION 

It is an object of this invention to provide a 
method of identifying an individual at an increased risk 
for developing cancer. 

30 

It is another object of this inventxon to 
provide a method for determining a prognosis in patients 
afflicted with cancer. 

35 It is a further object of this invention to 

provide a method for determining the proper course of 
treatment for a patient afflicted with cancer. 
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Further objects and advantages of the present 
invention will be clear from the description that follows. 

In one embodiment, the present invention relates 
5 to a method of identifying an individual at an increased 
risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
10 classifying an individual having one or more 

mutations in the region as having an increased risk for 
developing cancer. 

In another embodiment, the present invention 
15 relates to a method for determining a prognosis in 
patients afflicted with cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
20 classifying patients having no mutation in said 

region as being less likely to suffer disease relapse or 
having an increased chance of survival than those patients 
having one or more mutations in said region. 

25 In a further embodiment, the present invention 

relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer, comprising: 
amplifying a region of the c-raf-1 gene; 
analyzing products of said amplif ication for evidence 

30 of mutation; 

identifying a patient having at least one mutation in 

said region, which patient may require treatment proper 

for patients having a lesser chance of survival or 

decreased time to relapse; and 

35 identifying a patient lacking mutations in said 

region, which patients may require treatment proper for 
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patients having a greater chance of survival or being less 
likely to suffer disease relapse. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

Figure 1 . Effect of BHT promotion on ENU 
tumorigenesis in NFS/n x AKR mice. The X-axis represents 
percent tumor induced mortality within each group, and the 
Y-axis reflects age in weeks. All animals were exposed to 

10 ENU transplacental^ at a dose of 0.5mM/Kg of mother's 
body weight on day 16 of gestation (presence of vaginal 
plug was scored as day one). At two weeks of age mice 
were weaned into two separate groups and separated by sex. 
Both groups were housed in identical cages and supplied 

15 with food (Purina Lab Chow) and water ad. libitum . 

Beginning at three weeks of age, group 2A (0) was given 
weekly intraperitoneal (i.p. ) injection of corn oil (0.1 
ml), and group 2B (o) received weekly i.p. injections of 
BHT (20 mg/Kg of body weight) dissolved in corn oil. 

20 Administration of BHT reduces the mean age of mortality 
from approximately 20 weeks to 13, and decreases the 
initial age of mortality. These curves are significantly 
different (ps 0.001) as judged by a 2-tailed Cox test. In 
both groups the rate of tumorigenesis was identical for 

25 males and females. 

Figure 2. Northern blot analysis of proto- 
oncogene expression levels in ENU induced tumors. 

Figure 3. Diagnostic digestion of PCR amplified 
Ki-ras. genes from ENU induced tumors. Genomic DNA was 
isolated from a cesium chloride gradient during RNA 
preparations. In each case 10 ng was amplified via PCR 
(95°C, 5 min. followed by 35 cycles of 95°C, 1 min. -> 
35 55°C, 1 min. -» 72°, 1 min.) with 2 units of Tag I 
polymerase. The primers used (K1 ; 5'- 

AACTTGTGGTGGTTGGACCT— 3 ' -* (SEQ ID NO: 6) and K2; <= 3'- 



30 
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GTCTTAGTGAAACACCTACT-5 ' (SEQ ID NO: 7)) generate a 79 b.p. 
product. The primer K1 ends at codon 12 and contains a 
mismatch from normal mouse (Ki-ras sequence its 18th 
nucleotide (G -> C) creating a BstNI site (CCTGG) in 
5 conjunction with a normal codon 12 (GGT) . Digestion of 
amplified product from a normal allele with BstNI produces 
two products of 19 and 60 b.p., whereas a mutation in one 
of the first two positions of codon 12 will eliminate the 
BstNI site. The presence of two normal alleles results in 

10 all of the product being cleaved and the presence of one 
mutant and one normal allele will result in only half of 
the product cut. In the three panels each sample was run 
in duplicate, uncut and cut with BstNI. F1 is DNA from an 
untreated NFS/n X AKR F1 mouse, and MCA 5 is a murine cell 

15 line known to harbor a mutant K-ras codon 12 allele. One 
lymphoma (24Ly) and one cell line (117; derived from a 
lung adenocarcinoma) display a mutated Ki-ras codon 12 
allele; however, 24Ly was a passaged tumor and examination 
of the original tumor showed two normal alleles indicating 

20 that this mutation was acquired during passaging. 

Figure 4. c-raf-1 RNAse protection analysis of 
ENU induced tumors. The probe used was a 32P labeled 
antisense transcript from the 3' non-coding region of a 

25 mouse c -raf- 1 cDNA to the 3' most StuI site. 

Hybridization of this probe with normal RNA results in a 
protected fragment of 1 . 2kb covering the region encoding 
the Raf-1 kinase domain. One yig of poly (A) + RNA from each 
tumor and 5 >ig of F1 RNA (in order to get comparable 

30 signals) was hybridized for 12 hours at 52°C with 200,000 
cpm of labeled mouse c -raf antisense transcript. 
Hybrids were then digested for 30 minutes with 25 iig RNAse 
A and 33 units of RNAse Tl at room temperature. Digested 
hybrids were then, incubated with 50 >ig of proteinase K, 

35 phenol/chloroform extracted, ethanol precipitated, and 
resuspended in loading dye containing 80% formamide. 
Samples were then run on 6% polyacrylamide denaturing 
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sequencing gels at 65 watts. Gels were vacuum dried at 80 
degrees C and exposed to X-ray film. Probe is undigested 
probe alone; tRNA is probe hybridized to non-specific RNA; 
v-raf is probe hybridized to RNA from a v-raf transformed 
5 cell line and the bands detected represent single base 
mismatches between murine c-raf and v-raf; NFS/AKR F1 is 
probe hybridized with RNA from a normal (untreated) mouse; 
24 LY is probe hybridized with RNA from a lymphoma; and 
the remaining lanes are probe hybridized with RNA isolated 
10 from lung tumors. 

Figure 5. Schematic of Raf-1 protein showing 
sites of ENU induced mutations. CR1 , CR2, and CR3 
represent conserved regions 1, 2 and 3. cDNAs were made 

15 from tumor derived poly(A) + RNA using MoMuLV reverse 

transcriptase. Primers (MR1 sequence and MR2 sequence) 
encompassing a 435 base pair region c-raf were then used 
to amplify this region via PGR. The amplification mixture 
was then run on 1.7% agarose gels and the 435 bp Product 

20 isolated. This isolated fragment was then treated with T4 
polymerase and cloned into the Hindi site of M13mp18 for 
sequencing. Another set of primers (EMR1 sequence and 
EMR2 sequence) was designed containing EcoRI sites at the 
termini and used to amplify a 609 base pair region 

25 (encompassing the original 435 base pair region) 

isolated products from these reactions were then digested 
with EcoRI and cloned into the EcoRI site of KS. 
Sequencing reactions were carried out using the Sequenase 
kit (USB) according to the recommended protocols for 

30 single and double stranded sequencing. Sequencing 

reactions were run on 6% polyacrylamide denaturing gels at 
65 watts. Gels were vacuum dried at 80 degrees C and 
exposed to X-ray film. In each case a normal allele was 
also sequenced along with the mutant allele. 
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Figure 6- Schematic for Identifying c-raf-1 
mutations. Primers 1 and 2 are shown in SEQ ID NO: 8 and 
SEQ ID NO: 9, respectively. 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention relates to methods that 
involve amplifying a region of the c-raf-1 gene (the 
sequence of a mouse c-raf-1 gene is shown in SEQ ID NO: 10; 
10 the nucleotide and corresponding amino acid sequence of a 
human c-raf-1 gene is shown in SEQ ID NO: 11 and SEQ ID 
NO : 1 2 , respectively ) . 

In one embodiment, the present invention relates 

15 to a method of identifying an individual at an increased 
risk for developing cancer (preferably, lung cancer, T- 
cell lymphomas, renal cell carcinoma, ovarian carcinoma, 
and mixed parotid gland tumors) comprising: amplifying a 
region (preferably by using the polymerase chain reaction 

20 method (PCR) .or by cloning techniques) of the c-raf-1 gene 
of the individual (SEQ ID NO:11)(in one preferred 
embodiment, the region encodes amino acids 514 to 535 of 
SEQ ID NO: 12); analyzing products of the amplification for 
evidence of mutation (preferably by DNA sequencing of the 

25 region) and classifying an individual having one or more 
mutations in the region as having an increased risk for 
developing cancer. In one prefered embodiment, the region 
encodes amino acids 500 to 550 of SEQ ID NO: 12 or amino 
acids 450 to 630 of SEQ ID NO: 12. In another prefered 

30 embodiment, the PCR method employs a primer comprising the 
sequence shown in SEQ ID NO: 7 and a primer comprising the 
sequence shown in SEQ ID NO: 8. In another prefered 
embodiment, the method comprises the steps shown in Figure 
6. 

35 in another embodiment, the present invention 

relates to a method for determining a prognosis in a 
patient afflicted with cancer (preferably, those cancers 
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listed above). The method comprises: amplifying the 
region of the c-raf-1 gene as described above; analyzing 
products of the amplification for evidence of mutation as 
described above; and classifying a patient having no 
5 mutation in the region as being less likely to suffer 

disease relapse or having an increased chance of survival 
than a patient having one or more mutations in the region. 

In another embodiment, the present invention 
10 relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer (preferably, 
those cancers listed above), comprising: amplifying a 
region (described above) of the c-raf-1 gene as described 
above; analyzing products of the amplification for 
15 evidence of mutation as described above; identifying a 

patient having at least one mutation in the region, which 
patient may require treatment proper for patients having a 
lesser chance of survival or decreased time to relapse; 
and identifying^ patient lacking mutations in the region, 
20 which patients may require treatment proper for patients 
having a greater chance of survival or being less likely 
to suffer disease relapse. 

Administration of therapeutic agents (cytotoxic 
25 or cytostatic) tailored to recognize the mutant Raf-1 
protein but not normal Raf-1 could specifically target 
tumor cells for death of growth inhibition. Such agents 
could be comprised of cytotoxic T-cells, antibodies, 
and/or specifically designed chemical compounds. 

30 . ^ , 

The following Examples demonstrate consistent 

point mutations of the c-raf-1 proto-oncogene , within a 

small region of the kinase domain, in a mouse model for 

chemical tumor induction. This is the first demonstration 

35 of point mutated raf genes in vivo, and the first 

isolation of activating in vivo point mutations in the 

kinase domain of a proto-oncogene. The tumors examined 



WO 93/06248 



PCT/US92/07817 



-11- 

show a selective specificity for Raf-1 mutations as 
another family of genes, the ras proto-oncogenes which are 
frequently activated by point mutation in both animal and 
human tumors (S. Rodenhuis et al., Am. Rev. Respir. Pis. 
5 142, S27-30; T.R. Devereux et al., Carcinogenesis 12 f 299 
(1991)), is not involved. 

The present invention is described in further 
detail in the following non-limiting examples. 

10 

EXAMPLES 

The following protocols and experimental details 
are referenced in the examples that follow: 

15 RNA Isolation. Tumors were excised, a small portion 
minced in PBS (phosphate buffered saline solution) for 
passaging in nude mice, frozen immediately in a dry 
ice/ethanol bath, and stored at -70° until RNA extraction. 
Frozen tissues were minced on wet ice in a guanidine 

20 thiocyanate buffer (4M guanidine thiocyanate 1 OmM EDTA, 2% 
N-lauryl sarcosine, 2% beta-mercaptoethanol, 1 OmM Tris 
(pH=7.6)), disrupted in a Dounce homogenizer, and 
extracted three times with phenol: chloroform: isoamyl 
alcohol (24: 24: 2) • Supernatants were then transferred to 

25 SW41 tubes, 100 }ig of cesium chloride per ml was added to 
the supernatant which was then under lay ed with one half 
saturated cesium chloride in 1 OmM EDTA (pH=7.0; index of 
refraction 1.3995-1.4000), and centrifuged at 25,000 rpm 
for 20 hours in a Sorvall SW-41TI rotor using a Beckman 

30 model L5-50 ultracentrifuge . Supernatants were removed 
and RNA pellets dissolved in 4 ml resuspension buffer ( 1 0 
mM Tris-HCl pH=7.6, 5% beta-mercaptoethanol, 0.5% N-lauryl 
sarcosine, 10 mM EDTA), extracted once with 
phenol: chloroform: isoamyl alcohol, sodium acetate added to 

35 0.1 2M and RNA precipatated with two volumes ethanol at - 

20°C overnight. Precipitates were centrifuged at 9,000 rpm 
in a Sorvall SS-34 rotor for 30 minutes, and pellets 
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mM 



redissolved in RNA sample buffer (10 mM Tris pH=7.4, 1 
EDTA, 0.05% sodium dodecyl sulfate) and concentrations 
determined by absorbance at 260 nm. Poly (A)* RNA was 
isolated by binding to oligo dT cellulose columns in high 
5 salt nOmM Tris pH=7.4, 1mM EDTA, 0.05% SDS, 500mM NaCi), 
and eluting with RNA sample buffer heated to 40°C. 

Rlnhtinc. 5 ug poly (A)* RNA per lane was ethanol 
precipitated, desiccated, resuspended in loading buffer 

10 (20mM MOPS pH=6.8, 5mM sodium acetate, ImM EDTA, 50% 

formamide, 6% formaldehyde), heated at 65°C for 5 min., 
quick chilled on wet ice for 10 min., and electrophoresed 
through a 0.7% agarose gel containing 2.2 M formaldehyde, 
20mM MOPS [pH=6.8], 5mM sodium acetate, and ImM EDTA. 

15 Gels were then blotted overnight onto nitrocellulose 

filters via capillary transfer in 20X SSC, filters were 
washed in 3X SSC for 10 min. and baked at 80 °C for 2 



hours . 



^-•^^tions. Filters were prehybridized at 42°C in 5X 
SSC, 50% formamide, 20mM sodium phosphate P H=6.8, 200 
ug/ml PVP-40, 200 ug/ml ficoll 400, 200 ug/ml bovine serum 
albumin, and 200 ug/ml sonicated sheared salmon sperm DNA. 
Blots were. then hybridized with 500,000 cpm/ml of random 
25 primed *P labeled probes overnight at 42 °C in 

prehybridization solution with 5% dextran sulfate. Blots 
were washed with agitation in 2X SSC, 0.1% SDS at room 
temperature six times for 20 minutes each wash, then 
washed once at 45°C in 0.1X SSC for 15 minutes. Filters 
were exposed to X-AR 5 film at -70°C. 



KX AMPLE 1 
Tumor Tnduction 



NFS female mice were mated with AKR males and 
pregnant females given a transplacental injection of 1- 
ethy 1-1 -nitrosourea (ENU) at a dosage of 0.5 mM/Kg 
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mother's body weight on day 16 of gestation, counting plug 
date as day one. ENU was chosen for tumor induction since 
it is a very potent direct acting carcinogen capable of 
modifying any base in vivo (Singer, B. et al., 1983. 
5 Molecular Biology of Mutagens and C arcinogens, Plenum 

Press, New York). ENU alkylates all tissues with roughly 
the same efficiency (E. Scherer et al., Cancer Lett. 46, 
21 (1989)) and has a very short half life in vivo (E.M. 
Faustman et al.. Teratology 40, 199 (1989)) allowing 

10 specific mutagenesis of tissues which are mitotically 

active at a particular time. NFS and AKR were chosen as 
parental strains based on earlier studies which showed 
them to be particularly susceptible to ling tumors 
following ENU exposure (B. A. Diwan et al., Cancer Res. 34, 

15 764 (1974); S.L. Kauffman, JNCI 57, 821 (1976)). With 
this procedure nearly 100% of the offspring develop lung 
adenocarcinomas and approximately 7 0% develop, in 
addition, T-cell lymphomas with a mean latency of 
approximately 20 weeks. In order to achieve more rapid 

20 tumor development, weanling mice were treated with weekly 
injections of a tumor promoter, the antioxidant butylated 
hydroxytoluene or BHT (20mg/kg body weight dissolved in 
corn oil). BHT was used as it has been demonstrated to 
cause lung, lesions and hyperplasia when injected into mice 

25 (A. A. Marino et al., Proc. Soc . Exp. Biol. Med. 140, 122 
(1972); H. Witschi et al., Proc. Soc. Exp. Biol. Med,, 
147, 690 (1974); N. Ito et al., CRC Crit. Rev. Toxicol. 
15, 109 (1984)). In the present system it nearly doubles 
the rate at which tumors develop. Figure 1 compares tumor 

30 induced mortality with age of animals for those receiving 
ENU alone, and those receiving ENU and promoted with BHT. 
These curves demonstrate that when BHT is given the mean 
age of tumor induced mortality decreases from 
approximately 20 weeks to around 12, and there is also a 

35 decrease in initial latency. These curves are 

significantly different with a confidence limit greater 
than 99.99% using a 2-tailed Cox test. In addition, BHT 
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promotion, while increasing the rate at which tumors 
develop, does not affect the tumor spectrum. 

TRAMPLE 2 

5 nnnogen o Byp-rsssion 

Northern blot analysis revealed elevated levels 
of r-raf-1 , as compared to normal tissue, in every tumor 
examined (Figure 2), and Western blot analysis showed that 

10 protein levels correlated with message levels (U.R. Rapp 
et al , in nn~~T— fencer. S.A. Aaronson et al., 

Eds (Tokyo/VNU Scientific Press, Tokyo, 1987) pp. 55-74). 
in addition, in cell lines derived from primary tumors, 
Raf-1 protein kinase activity was shown by immune-complex 

15 kinase assays to be constitutive. Further analysis of 
other oncogenes revealed no consistent pattern of 
expression except for ras and myc family genes. In the 
case of the myc family, one member (either c-, N-, or L- 
BBB) was overexpressed but never more than one. For the 

20 ras genes, at least one member (Ki-, Ha-, or N-ras ) , and 
often more than one, was expressed at high levels when 
compared with the normal tissue. In addition all 
oncogenes examined via Northern analysis exhibited full 
length, normal sized transcripts. 

25 ^ genes were considered likely candidates for 

mutational activation since oncogenic forms of Ki-ras. have 
previously been observed in lung tumors (S. Rodenhuis et 
al , - — D^. 142, S27-30; T.R. Devereux et 

30 al., r-^mrvrnn-la 12, 299 (1991 )) and ENU is a point 
mutagen (Singer, B . et al., 1983. Mol^irtnr Biology of 
^-. ^ „nd c ^nfnooens. Plenum Press," New York) . A 
systematic analysis of various ras codons known to be 
involved in oncogenic activation was therefore performed . 

35 Ha- Ki-, and N- ras were examined at codons 12, 13, and 
61 for potential mutations via RNAse protection assays 
(R.M. Myers et al. , Scienpe 230, 1242 (1985); B. Winter et 



WO 93/06248 



PCT/US92/07817 



-1 5- 

al., Prnn. Natl- Acad. Sci . USA 82, 7575 (1985)), PCR 
amplification followed by subsequent sequencing (F. Sanger 
et al., .7. Mol. Biol. 13, 373 (1975)), and PCR 
amplification followed by diagnostic restriction digests 
5 (W. Jiang et al., Oncogene 4, 923 (1989)). PCR 

amplification creating diagnostic enzyme sites is a very 
efficient way of examining alleles for mutations at known 
sites and involves designing a PCR primer whose 3' end 
lies next to and produces a novel restriction site 

10 encompassing the codon of interest. Following 

amplification, PCR products from normal alleles will 
contain the new restriction site, while mutant alleles 
will not. Digestion of the product from tissue with two 
normal alleles results in all product being cut; however, 

15 if one allele contains a mutation, only half of the 

product will be digested. Figure 3 shows the results of 
amplification and diagnostic digestion applied to Ki-ras 
codon 12 in several tumors and cell lines. The first 
panel is from a set of lymphomas. F1 is DNA from a normal 

20 untreated mouse and both alleles are cut by BstNl, 

indicating the presence of two normal alleles. MCA5 is a 
murine cell line known to contain a Ki-ras codon 12 
mutation (L.F. Parada et al., Mol. Cell, Biol, 3, 2298 
(1983)), and only the amplified normal allele is cleaved. 

25 Of the five tumors shown in the second panel, one shows a 
mutant Ki-ra& allele. The next panel shows some of the 
lung tumors tested and none of them exhibit a mutant 
allele, and the final panel shows tumor derived cell 
lines. The first three are from lymphomas and the last 

30 three from lung adenocarcinomas. One lung tumor line 

(#117) has a Ki -ras 12 mutation that was not present in 
the primary tumor but came up upon transplantation. This 
analysis has been performed with Ki, Ha and N-ras genes at 
codons 12 and 61. Of all the tumors and cell lines 

35 examined by this method for mutations of the three ras 
genes at codons 12 and 61, the two shown here were the 
only ones detected. Examination of codon 13 was done by 
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PCR amplification of genomic DNA surrounding codon 13 
followed by cloning into KS + (Stratagene) and double 
stranded sequencing- Table I summarizes the ras mutation 
data. The most notable point from this table is the 
5 conspicuous lack of ras mutations in these tumors. In 
fact the number of ras mutations is much lower than would 
be expected for a sampling of spontaneous tumors (S. 
Rodenhuis et al. , Am. Rev. Respir. Pis. 142, S27-30; T.R. 
Devereux et al., ^nnamasl. 12, 299 (1991 ); J.L. Bos, 
10 fencer Res. 49, 4682 (1989)). Having eliminated ras genes 
as playing a primary role in the genesis of these ENU 
induced tumors, c-raf-1 was investigated for possible 
small or point mutations. 

15 TABLE I 

■p ,nrt c e ll T.-ir^s Positive for ras Mutations 

Codon 12 Codon 13 Codon 61 



20 





Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


Ha-ras 
Ki/ras 
N— ras 


0/10 
T/10 
0/10 


0/6 
1/6 
0/6 


0/6 
0/6 
0/6 


0/2 
0/2 
0/2 


0/10 
0/6 
0/10 


0/6 
0/2 
0/6 



* This was a second passage tumor in which the original 
25 tumor did not contain a Ki-ras mutation. 

Table I: Summary of mutation analysis for Ha-, Ki-, and 
N - r as at codons 12, 13, and 61. Each box displays the 
number of mutations detected, over the number of tumors 
30 and tumor derived cell lines examined via RNAse 

protection, sequencing or diagnostic digestion, for each 
of the nine codons. 

re Y AMPLE 3 

35 Mutatio n in Raf-1 

Since no point mutations had been described 
for raf genes in vivo, as had been for the ras genes (E. 
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Santos et al., Science 223, 661 (1984); S. Rodenhuis, 1L. 
Enal. J- Med. 317, 929 (1987); M. Barbacid, Eur. J. 
Clin. Invest. 20, 225 (1990); F. Sanger et al., J. Mol. 
Biol. 13, 373 (1975)), point mutations were screened for 
5 using RNAse protection assays (R.M. Myers et al., 

Science 230, 1242 (1985); E. Winter et al., Proc. Natl. 
Acad. Sci. USA 82, 7575 (1985)). Figure 4 shows a 
typical protection assay using a c-raf-1 probe. In this 
experiment the probe used covered the 3' end of raf-1 

10 from the 3' most StuI site to the end of the coding 

sequence. The first lane is a marker (pBR322 digested 
with Haelll), the second shows the probe alone 
undigested, the third lane shows the probe hybridized to 
unrelated RNA in this case tRNA, the fourth lane shows 

15 hybridization with v- raf transformed cells and the lower 
bands represent cleavage at points where the mouse c- 
raf- 1 gene differs from v- raf . The fifth lane shows 
hybridization with RNA isolated from a normal lung of an 
untreated F1 mouse, the next lanes are RNA isolated from 

20 several tumors. In the case of the normal RNA, only 

one, fully protected, band is detected while in the case 
of the tumors two major bands are seen after digestion. 
20 out of 20 tumors analyzed in this fashion showed this 
extra band. These data demonstrate the following major 

25 points: 1) there is a tumor specific alteration in c- 
raf-1 that results in a region of non-homology 
recognizable by either RNAse A or T1 ; 2) The alterations 
are confined to the same region of one allele as two 
bands of equal size are present in the tumor lanes, and; 

30 3) both alleles were expressed at comparable levels as 

both bands are of approximately equal intensity. In the 
assay shown 5 >xg of poly (A) + RNA was hybridized from 
normal tissue, and 1 >ig was used from the tumors. This 
was necessary to get signals that could be compared on 

35 the same gel due to the overexpression of c-raf-1 in the 
tumors. By running these assays with various markers it 
was possible to estimate the approximate site of the 



WO 93/06248 



PCT/US92/07817 



10 



-1 8- 

alteration(s) to be in the vicinity of the exon 14/exon 
15 junction. In order to define the precise genetic 
alteration or alterations, PCR primers were designed 
which would generate a 600 bp fragment encompassing this 
region. cDNAs from tumor derived RNA were then 
amplified and cloned into KS+ (Stratagene) for double 
stranded sequencing. The sequencing results from 
several tumors are shown in Figure 5 . The top portion 
of Figure 5 presents a cartoon of the mouse Raf-1 
protein. There are three conserved regions CRl , CR2 and 
CR3 with CR3 representing the kinase domain. The probe 
used in the RNAse protection assays covers the indicated 
area, and the PCR primers amplified the bracketed 
region. Sequencing through this area revealed a variety 
15 of mutations just downstream of the APE site. These 

mutants are shown in an expanded version at the bottom 
of Figure 5 (See also SEQ ID N0:1 for normal mouse 
sequence and SEQ ID NO: 2, 3, 4, and 5 for mutant 
sequences). These mutants were isolated from four 
separate tumors, and in each case a normal allele (SEQ 
ID N0:l) was also sequenced. Repeating the cDNA 
synthesis, PCR amplification, cloning and sequencing 
gives the same sequence and normal tissue shows no 
mutations demonstrating that these alterations are not 
artif actual. Sequence covering the amplified region has 
been examined and it is interesting that all of these 
changes occur within a very small region of the raf 
protein. In fact the region where these mutations occur 
overlaps an epitope shared by monoclonal antibodies 
generated against raf (W. Kolch et al., Oncpqene 5, 713 
(1990)), and computer modeling of the protein shows this 
to be a hydrophilic domain, the structure of which is 
predicted to be altered by these mutations. This 
indicates a biologically important region for the 
molecule and indeed the first of these mutation tested 
in NIH3T3 cell assays, after cloning into a retroviral 
expression vector (El-neo, (G. Heidecker et al., MgJ^ 



20 



25 



30 



35 
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Cell. Biol. 10, 2503 (1990))), was found to be weakly 
transforming when driven by a Moloney LTR. The 
transformation efficiency was comparable to EC2, a 
previously characterized mutation of human c-raf-1 cDNA 
(G. Heidecker et al., Mol. Cell. Biol. 10, 2503 (1990); 
C. Wasylyk et al . , Mol. Cell- Biol. 9, 2247 (1989)) and 
-20 fold lower than the v-raf oncogene. 



10 All publications mentioned hereinabove are 

hereby incorporated in their entirety by reference. 

While the foregoing invention has been 
described in some detail for purposes of clarity and 
15 understanding, it will be appreciated by one skilled in 
the art from a reading of this disclosure that various 
changes in form and detail can be made without departing 
from the true scope of the invention and appended 
claims . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Rapp, Ulf R- 

Storm, Stephen M. 

(ii) TITLE OF INVENTION: DETECTION METHOD FOR C-RAF-1 GENES 
(iii) NUMBER OF SEQUENCES: 12 

fiv) CORRESPONDENCE ADDRESS: 

nv; wjt NEEDLE & ROSENBERG, P.C. 

(B) STREET: Suite 400, 133 Carnegie Way, N.W. 

(C) CITY: Atlanta 

(D) STATE: Georgia 

(E) COUNTRY: USA 

(F) ZIP: 30303-1031 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #l.Z5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Perryman, David G. 

(B) REGISTRATION NUMBER: 33,438 

(C) REFERENCE/DOCKET NUMBER: 1414.0421 

fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (404) 688-0770 

(B) TELEFAX: (404) 688-9880 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
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Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
180 185 190 

leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 • 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 _ 300 
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Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 

305 310 315 3^0 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 



325 



Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

340 345 - 30U 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 ■*« 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp lie 

435 440 
Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 

450 455 
lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 

thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 ^ 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

530 535 MU 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 5 50 555 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 



WO 93/06248 



PCT/US92/07817 



-23- 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 iyu 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 Z40 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 tbb 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
r 275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 3ZU 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 330 " s 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 « u 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 

420 425 ^ u 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu lie Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 46D 
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II e His Arg Asp Met Lys Ser Asn Asn lie Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val Val Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly lie Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
15 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 
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Thr Asp Ser Ser Lys Thr Ser Asn Thr lie Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 9° y3 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 11U 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
* 115 120 12» 

Asp Phe Leu Asp His Val Pro lie Thr Thr His Asn Phe Ala Arg Lys 
130 135 1*0 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 15= 1DU 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 !70 1/3 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 iyu 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 ZZ0 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 ^ 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 " 3 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 

Asp Ser Arg Met lie Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
- 310 315 ■S' u 



305 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 330 ■ 3 "" 
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Asd Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
K 340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys lie Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp lie 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn lie 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 
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Leu His Arg Ala Ala His Thr 61 u Asp He Asn Ala Cys Thr Leu Thr 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 1° lt> 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 

35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 

50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 



115 



Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 

130 135 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 1" ■ 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 1/0 1,0 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
180 185 1SU 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 
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Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 . 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
K 340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys lie Leu Lys Val Val Asp Pro Thr Pro 
370 .375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 
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Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 blU 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Ser Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 =°u 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 3/: > 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 sau 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
J 595 600 60b 

Leu Gin His Ser Leu Pro Lys lie Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 OHU 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID NO: 5: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys lie Ser Pro Thr He 
20 25 JU 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 B0 
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Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 190 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
K 340 345 350 

Thr Arg lie Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 
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His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 4UU 

Arg His Val Asn lie Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu. Gly Leu 
465 4 ?0 

Thr Val Lys lie Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 ^ 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 s 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Thr Cys Thr Phe Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 5f>U 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 5i*u 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 
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(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
AACTTGTGGT GGTTGGACCT 20 
(2) INFORMATION FOR SEQ ID N0:7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TCATCCACAA AGTGATTCTG 20 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CO) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: . 

AGGAGACCAA GTTTCAGATG 20 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GCGTGCAAGC ATTGATATCC 20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1947 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGAGCACA 


TACAGGGAGC 


TTGGAAGACG 


ATCAGCAATG 


/* rTi t p p A PT 
GCI 1 1 bbAL 1 


TAAAGATGCG 


60 


GTGTTTGATG 


GCTCCAGCTG 


CATCTCCCCT 


ACCATTGTTC 


AGCAG1 1 1Gb 


pTATrARrcr 

L 1 M I L.MV3WUL. 


120 


CGGGCCTCAG 


ATGATGGCAA 


GCTCACGGAT 


TCTTCTAAGA 


LAAbLAA 1 MU 


TATCCGGGTT 


180 


TTCTTGCCGA 


ATAAGCAAAG 


GACTGTGGTC 


AATGTGCGGA 


ATPPA ATC AC 
A 1 ubAA 1 brtva 


CTTACATGAC 


240 


TGCCTTATGA 


AAGCTCTGAA 


GGTGAGAGGC 


CTGCAGCCAG 


Au 1 bL 1 b 1 ut 


AGTGTTCAGA 

AAu lull wnwn 


300 


* 

CTTCTCCAGG 


AACACAAAGG 


TAAGAAAGCA 


n f 1 T A ^ A TT 

CGCTTAGATT 


PP A APAPPCA 

bbAALALLbH 


TGCCGCCTCT 

1 uwWUWW Iwl 


360 


CTGATTGGAG 


AAGAACTGCA 


AGTGGATTTT 


TTGGATCATG 


TTPPPATPAP 


AACTCACAAC 


420 


TTTGCTCGGA 


AAACGTTCCT 


GAAGCTTGCA 


_ mi mm jk TA A t± A 

TTCTGTGACA 


TrTrT^AT A A 

TCTb 1 LAbAA 




480 


AATGGATTTC 


GATGTCAGAC 


TTGTGGCTAC 


AAGTTTCATG 


AbCAC 1 b 1 Ab 


TAPrAAAGTA 


540 


CCTACTATGT 


GTGTGGACTG 


GAGTAATATC 


AGACAGCTCT 


TGCTGI 1 I L.L 


AAATTPPACT 


600 


GTTGGTGACA 


GTGGAGTCCC 


AGCACCACCT 


TCTTTCCCAA 


-rr» r»/*Tf*P PAX 

TGCGTCbbA 1 


CPRARAATrT 


660 


GTTTCCCGGA 


TGCCTGCTAG 


TTCCCAGCAC 


AGATACTCTA 


CACCCCATGu 


PTTPAPI 1 IP 


720 


AACACCTCCA 


GCCCTTCCTC 


AGAAGGTTCC 


CTCTCCCAGA 


r KOA f*f*~Tf* 

GGCAGAGGTC 


AAPGTPPArT 


780 


CCCAATGTCC 


ACATGGTCAG 


CACCACCCTG 


CATGTGGACA 


GCAGGATG A I 


TltACRATCrA 


840 


ATTCGAAGTC 


ACAGTGAATC 


AGCCTCACCT 


TCAGCCCTGT 


CCAGCAGCCU 


AAArAAPPTG 
AAML.HMUU 1 13 


900 


GGTCCAACAG 


GCTGGTCACA 


GCCCAAAACC 


CCCGTGCCAG 


CACAAAGAGA 


PPCGGPAfTA 


960 


GGATCTGGGA 


CCCAGCAAAA 


AAACAAAATT 


AGGCCTCGTG 


GGCAGAGAGA 


PTPGAGTTAT 
L 1 Ubnu I Inl 


1020 


TACTGGGAAA 


TAGAAGCCAG 


TGAGGTGATG 


CTGTCTACTC 


GGATCGGGTC 


AGGTTCCT T I 


iuou 


GGCACTGTGT 


ACAAGGGCAA 


GTGGCATGGA 


GATGTTGCAG 


TAAAGATCCT 


AAAGGTGGTT 


1140 


GACCCAACTC 


CAGAGCAACT 


TCAGGCCTTC 


AGGAACGAGG 


TGGCTGTTTT 


GCGCAAAACA 


1200 


CGGCATGTTA 


ACATCCTGCT 


GTTCATGGGG 


TACATGACAA 


AGGACAACCT 


GGCGATTGTG 


1260 


ACTCAGTGGT 


GTGAAGGCAG 


CAGTCTCTAC 


AAACACCTGC 


ATGTCCAGGA 


GACCAAATTC 


1320 


CAGATGTTCC 


AGCTAATTGA 


CATTGCCCGA 


CAGACAGCTC 


AGGGAATGGA 


CTATTTGCAT 


1380 


GCAAAGAACA 


TCATCCACAG 


AGACATGAAA 


TCCAACAATA 


TATTTCTCCA 


TGAAGGCCTC 


1440 


ACGGTGAAAA 


TTGGAGATTT 


TGGTTTGGCA 


ACAGTGAAGT 


CACGCTGGAG 


TGGTTCTCAG 


1500 


CAGGTTGAAC 


AGCCCACTGG 


CTCTGTGCTG 


TGGAT6GCCC 


CAGAAGTAAT 


CCGGATGCAG 


1560 
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GATGACAACC CGTTCAGCTT CCAGTCCGAC GTGTACTCGT ACGGCATCGT GCTGTACGAG 1620 

CTGATGGCTG GGGAGCTTCC CTACGCCCAC ATCAACAACC GAGACCAGAT CATCTTCATG 1680 

GTAGGCCGTG GGTATGCATC CCCTGATCTC AGCAGGCTCT ACAAGAACTG CCCCAAGGCA 1740 

ATGAAGAGGT TGGTGGCTGA CTGTGTGAAG AAAGTCAAAG AAGAGAGACC TTTGTTTCCC 1800 

CAGATCCTGT CTTCCATCGA GCTGCTTCAG CACTCTCTGC CGAAAATCAA CAGGAGCGCC 1860 

TCTGAGCCTT CCCTGCATCG GGCAGCTCAC ACTGAGGACA TCAATGCTTG CACGCTGACT 1920 

ACATCCCCAA GGCTACCAGT CTTCTAG . 1947 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1944 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GAG CAC ATA CAG GGA GCT TGG AAG ACG ATC AGC AAT GGT TTT GGA 48 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

TTC AAA GAT GCC GTG TTT GAT GGC TCC AGC TGC ATC TCT CCT ACA ATA 96 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

GTT CAG CAG TTT GGC TAT CAG CGC CGG GCA TCA GAT GAT GGC AAA CTC 144 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

ACA GAT CCT TCT AAG ACA AGC AAC ACT ATC CGT GTT TTC TTG CCG AAC 192 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

AAG CAA AGA ACA GTG GTC AAT GTG CGA AAT GGA ATG AGC TTG CAT GAC 240 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 
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TGC CTT AT6 AAA GCA CTC AAG GTG AGG GGC CTG CAA CCA GAG TGC TGT 288 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 s:> 

GCA GTG TTC AGA CTT CTC CAC GAA CAC AAA GGT AAA AAA GCA CGC TTA 336 

Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 

100 105 
GAT TGG AAT ACT GAT GCT GCG TCT TTG ATT GGA GAA GAA CTT CAA GTA 384 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 A " 

GAT TTC CTG GAT CAT GTT CCC CTC ACA ACA CAC AAC TTT GCT CGG AAG 432 

Asp Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 

130 I 35 
ACG TTC CTG AAG CTT GCC TTC TGT GAC ATC TGT CAG AAA TTC CTG CTC 480 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 

AAT GGA TTT CGA TGT CAG ACT TGT GGC TAC AAA TTT CAT GAG CAC TGT 528 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 

AGC ACC AAA GTA CCT ACT ATG TGT GTG GAC TGG AGT AAC ATC AGA CAA 576 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 

180 185 
CTC TTA TTG TTT CCA AAT TCC ACT ATT GGT GAT AGT GGA GTC CCA GCA 624 
Leu Leu Leu Phe Pro Asn Ser Thr He Gly Asp Ser Gly Val Pro Ala 

CTA CCT TCT TTG ACT ATG CGT CGT ATG CGA GAG TCT GTT TCC AGG ATG 672 

Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 "° 

CCT GTT AGT TCT CAG CAC AGA TAT TCT ACA CCT CAC GCC TTC ACC TTT 720 

Pro Val Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 

AAC ACC TCC AGT CCC TCA TCT GAA GGT TCC CTC TCC CAG AGG CAG AGG 768 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 " 3 

TCG ACA TCC ACA CCT AAT GTC CAC ATG GTC AGC ACC ACG CTG CCT GTG 816 
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Ser Thr Ser Thr Pro Asn Val 
260 



His Met Val Ser Thr 
265 



Thr Leu Pro Val 
270 



GAC AGC AGG ATG ATT GAG GAT GCA ATT CGA AGT CAC AGC GAA TCA GCC 864 



Asp Ser Arg Met He Glu Asp 
275 



Ala He Arg Ser His 
280 



Ser Glu Ser Ala 
285 



TCA CCT TCA GCC CTG TCC AGT AGC CCC AAC AAT CTG AGC CCA ACA GGC 912 

Ser Pro Thr Gly 



Ser Pro Ser Ala Leu Ser Ser 
290 295 



Ser Pro Asn Asn Leu 
300 



TGG TCA CAG CCG AAA ACC CCC GTG CCA GCA CAA AGA GAG CGG GCA CCA 960 

Glu Arg 



Trp Ser Gin Pro Lys Thr Pro 
305 310 



Val Pro Ala Gin Arg 
315 



Ala Pro 
320 



GTA TCT GGG ACC CAG GAG AAA AAC AAA ATT AGG CCT CGT GGA CAG AGA 1008 

Arg Gly 



Val Ser Gly Thr Gin Glu Lys 
325 



Asn Lys He Arg Pro 
330 



Gin Arg 
335 



GAT TCA AGC TAT TAT TGG GAA ATA GAA GCC AGT GAA GTG ATG CTG TCC 1056 

Leu Ser 



Asp Ser Ser Tyr Tyr Trp Glu 
340 



He Glu Ala Ser Glu Val Met 
345 350 



ACT CGG ATT GGG TCA GGC TCT TTT GGA ACT GTT TAT AAG GGT AAA TGG 1104 

Lys Trp 



Thr Arg He Gly Ser Gly Ser 
355 



Phe Gly Thr Val Tyr 
360 



Lys Gly 
365 



CAC GGA GAT GTT GCA GTA AAG ATC CTA AAG GTT GTC GAC CCA 

Asp Pro 



His Gly Asp Val Ala Val Lys 
370 375 



He Leu Lys Val Val 
380 



ACC CCA 1152 
Thr Pro 



GAG CAA TTC CAG GCC TTC AGG AAT GAG GTG GCT GTT CTG CGC AAA ACA 1200 



Glu Gin Phe Gin Ala Phe Arg 
385 390 



Asn Glu Val Ala Val 
395 



Leu Arg Lys Thr 
400 



CGG CAT GTG AAC ATT CTG CTT TTC ATG GGG TAC ATG ACA AAG GAC AAC 1248 



Arg His Val Asn He Leu Leu 
405 



Phe Met Gly Tyr Met 
410 



Thr Lys Asp Asn 
415 



CTG GCA ATT GTG ACC CAG TGG TGC GAG GGC AGC AGC CTC TAC AAA CAC 1296 



Leu Ala lie Val Thr Gin Trp 
420 



Cys Glu Gly Ser Ser 
425 



Leu Tyr Lys His 
430 



CTG CAT GTC CAG GAG ACC AAG TTT CAG ATG TTC CAG CTA ATT GAC ATT 1344 
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Leu His 



Val Gin 
435 



Glu Thr Lys Phe Gin Met 
440 



Phe Gin Leu lie Asp He 
445 



GCC CGG CAG ACG 

Ala Arg Gin Thr 
450 



GCT CAG 
Ala Gin 



GGA ATG GAC TAT TTG CAT 



ATC CAT AGA GAC ATG AAA 
Arg Asp 



lie His 
465 

ACA GTG 

Thr Val 

AGT GGT 
Ser Gly 

GCC CCA 
Ala Pro 



Met Lys 
470 



Gly Met Asp Tyr 
455 

TCC AAC AAT ATA 
Ser Asn Asn lie 



Leu His 
460 



GCA AAG AAC ATC 1392 
Ala Lys Asn lie 



TTT CTC CAT GAA GGC TTA 1440 
His Glu 



Phe Leu 
475 



Gly Leu 
480 



AAA ATT 
Lys lie 



GGA GAT TTT GGT 
Phe Gly 



Gly Asp 
485 



TCT CAG CAG GTT 

Ser Gin Gin Val 
500 



GAA CAA 
Glu Gin 



TTG GCA ACA GTA 

Leu Ala Thr Val 
490 

CCT ACT GGC TCT 
Gly Ser 



GAG GTG ATC CGA ATG CAG 
lie Arg 



Glu Val 
515 



Met Gin 
520 



Pro Thr 
505 

GAT AAC 
Asp Asn 



AAC CCA 
Asn Pro 



AAG TCA CGC TGG 1488 

Lys Ser Arg Trp 
495 

GTC CTC TGG ATG 1536 

Val Leu Trp Met 
510 

TTC AGT TTC CAG 1584 
Phe Gin 



Phe Ser 
525 



TCG GAT GTC TAC 

Ser Asp Val Tyr 
530 

GAG CTT CCT TAT 
Pro Tyr 



TCC TAT 
Ser Tyr 



GGC ATC GTA TTG 
Val Leu 



Gly He 
535 



TAT GAA CTG ATG 

Tyr Glu Leu Met 
540 



ACG GGG 1632 
Thr Gly 



Glu Leu 
545 

GTG GGC 
Val Gly 



CGA GGA 
Arg Gly 



TCT CAC ATC AAC 

Ser His lie Asn 
550 

TAT GCC TCC CCA 

Tyr Ala Ser Pro 
565 



AAC CGA 
Asn Arg 



GAT CAG ATC ATC TTC ATG 1680 
lie He 



Asp Gin 
555 



Phe Met 
560 



TGC CCC AAA GCA ATG AAG AGG CTG 

Cys Pro Lys Ala Met Lys Arg Leu 
580 

AAG GAA GAG AGG CCT CTT 

Lys Glu Glu Arg Pro Leu Phe Pro 
595 600 

CTC CAA CAC TCT CTA CCG AAG ATC AAC CGG 

Leu Gin His Ser Leu Pro Lys He Asn Arg 



Val Ala 
585 

TTT CCC CAG ATC 
Gin lie 



GAT CTT AGT AAG 

Asp Leu Ser Lys 
570 

GTA GCT GAC TGT 
Asp Cys 



CTG TCT 
Leu Ser 

AGC GCT 
Ser Ala 



CTA TAT AAG AAC 1728 

Leu Tyr Lys Asn 
575 

GTG AAG AAA GTA 1776 

Val Lys Lys Val 
590 

TCC ATT GAG CTG 1824 

Glu Leu 



Ser lie 
605 

TCC GAG 
Ser Glu 



CCA TCC 1872 
Pro Ser 
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610 615 620 

TTG CAT CG6 GCA GCC CAC ACT GAG GAT ATC AAT GCT TGC ACG CTG ACC 1920 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

ACG TCC CCG AGG CTG CCT GTC TTC TAG 1947 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 13 

Leu Leu Leu Phe Pro Asn Ser Thr He Gly Asp Ser Gly Val Pro Ala 
195 200 20b 

Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 Z20 

Pro Val Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 *W 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 

245 2.50 " 3 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 



305 



310 315 320 

Val Ser Gly Thr Gin Glu Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 330 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

340 345 JDU 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 • 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 4UU 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
s 405 410 415 

Leu Ala lie Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 «u 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 
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Ile His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
51.5 520 525. 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 
530 535 540 

Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin lie He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 
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ymAT IS fTTAIMEP IS : 

1 . A method of identifying an individual at an 
increased risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying an individual having one or more 
mutations in said- region as having an increased risk for 
developing cancer. 

2 The method according to claim 1 , wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
NO: 12. 

3 The method according to claim 2, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO : 1 2 . ^ 

4 The method according to claim 3, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO : 1 2 . 

5. The method according to claim 1, wherein said 
products are analyzed by DNA sequencing. 

6. The method according to claim 1, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR) . 

7. The method according to claim 6, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 

8. A method for determining a prognosis in 
patients afflicted with cancer, comprising: 
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amplifying a region of the c-raf— 1 gene of said 
individuals- 
analyzing products of said amplification for 
evidence of mutation; and 

classifying a patient having no mutation in said 
region as being less likely to suffer disease relapse or 
having an increased chance of survival than a patient 
having one or more mutations in said region. 

9. The method according to claim 8, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
NO : 1 2 . 

10. The method according to claim 9, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO : 1 2 . 

11. The method according to claim 10, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO : 1 2 . 

12. The method according to claim 9, wherein said 
products are analyzed by DNA sequencing . 

13. The method according to claim 9, wherein said 
amplification is effected using polymerase chain 
reaction (PCR). 

14. The method according to claim 13, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 

15. A method for determining the proper course of 
treatment for a patient afflicted with cancer, 
comprising: 

amplifying a region of the c-raf-1 gene of said 
patient; 
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analyzing products of said amplification for 
evidence of mutation; 

identifying a patient having at least one mutation 
in said region, which patient may require treatment 
proper for a patient having a lesser chance of survival 
or decreased time to relapse; and 

identifying a patient lacking mutations in said 
region, which patient may require treatment proper for a 
patient having a greater chance of survival or being 
less likely to suffer disease relapse. 

16 The method according to claim 15, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
NO : 1 2 . 

17 The method according to claim 16, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO : 1 2 . 

18 The method according to claim 17, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO : 1 2 . 

19. The method according to claim 16, wherein said 
products are analyzed by DNA sequencing. 

20. The method according to claim 16, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR) - 

21 The method according to claim 20, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 
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Mo DMA Control 
F1 
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Figure 6 



1. Polymerase Chain Reaction (PCR) Amplification of Target DNA 

Either Genomic DNA or cDNA 

Nucleotide 1307 of coding Nucleotide 1326 of coding 

Primer 1 = 5'- AGGAGACCAAGTTTCAGATG -3' 

Nucleotide 1915 of coding Nucleotide 1896 of coding 

I I 

Primer 2 = 5'- GCGTGCAAGCATTGATATCC -3' 



PCR Cycles: 



94 O C, 1 minute 
55 C, 1 minute 

72 C, 1 minute 
I 



■ 1 

Repeat 

35 cycles 



3. Digestion with 
Sphl and Bglll 



4. Clone into 
c-raf-1 
Containing 
Vector 

i 



Isolate Product 




4. Clone into 
^ Alternate 
Vector 



3. Direct Sequencing 
Asymmetric PCR 



Standard 
Di-Deoxy 
Sequencing 
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